Comparison of discriminative training methods for speaker verification
نویسندگان
چکیده
The maximum likelihood estimation (MLE) and Bayesian maximum a-posteriori (MAP) adaptation methods for Gaussian mixture models (GMM) have proven to be effective and efficient for speaker verification, even though each speaker model is trained using only his own training utterances. Discriminative criteria aim at increasing discriminability by using out-of-class data. In this paper, we consider the speaker verification task using three discriminative training methods to compare performance. Comparisons are discussed for the maximum mutual information (MMI), minimum classification error (MCE) and figure of merit (FOM) criteria. Experiments on the 1996 NIST speaker recognition evaluation data set show that FOM training method outperforms the other two methods for speaker verification in terms of system performance. Meanwhile, logistic regression is investigated and successfully employed as a discriminative scorenormalization technique.
منابع مشابه
Discriminative PLDA training with application-specific loss functions for speaker verification
Speaker verification systems are usually evaluated by a weighted average of its false acceptance (FA) rate and false rejection (FR) rate. The weights are known as the operating point (OP) and depend on the applications. Recent researches suggest that, for the purpose of score calibration of speaker verification systems, it is beneficial to let discriminative training emphasize on the operating ...
متن کاملDiscriminative Training of Minimum Cost Speaker Verification Systems
This paper presents a new training procedure for speaker verification systems. The procedure extends previous speaker verification work by (1) developing a new discriminative a posteriori-based training algorithm, and (2) extending the algorithm to directly optimize speaker verification performance. The key features of the new training algorithm include leveraging current state of the art techn...
متن کاملDETAC: a discriminative criterion for speaker verification
This paper introduces a general criterion applicable to discriminative training of detection systems, and discusses its particular implementation in GMM-based text-independent speaker verification. Based on an analysis of the detection error trade-off curve of a baseline system, we argue that the new criterion extends several conventional methods such as the maximum posterior training by logist...
متن کاملUnsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
This paper presents, for the first time, unsupervised discriminative training of probabilistic linear discriminant analysis (unsupervised DT-PLDA). While discriminative training avoids the problem of generative training based on probabilistic model assumptions that often do not agree with actual data, it has been difficult to apply it to unsupervised scenarios because it can fit data with almos...
متن کاملConstrained discriminative speaker verification specific to normalized i-vectors
This paper focuses on discriminative trainings (DT) applied to ivectors after Gaussian probabilistic linear discriminant analysis (PLDA). If DT has been successfully used with non-normalized vectors, this technique struggles to improve speaker detection when i-vectors have been first normalized, whereas the latter option has proven to achieve best performance in speaker verification. We propose...
متن کامل